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Abstract 

Back-reaction of fields plays an important role in the generation of particle masses and the mass- 
energy equivalence of special relativity, but the most natural demonstrations using classical models 
result in apparent errors such as the notorious "4/3" problem. Here we discuss the resolution of 
these discrepancies within the underlying atomic description of matter, with the aim of encouraging 
classroom discussion of back-reaction and its connection to relativistic effects. 
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I. INTRODUCTION 



Equivalence between mass and energy is a cornerstone of special relativity, yet the formal 
derivations provide little physical intuition. Students can calculate quantitatively correct 
values for the masses of different configurations, e.g. a capacitor with varying plate separa- 
tion, but often gain little or no qualitative understanding of the mechanisms which produce 
the differences. 

One of the primary mechanisms has been understood, at least in principle, since the 
1881 observation of J.J. Thomson that a charged, accelerating sphere experiences a "back- 
reaction" force from its own electromagnetic (EM) field, which acts to resist the acceleration 
and effectively increases the mass of the sphere.- - - This effect, and its extension to other 
systems of charges such as capacitors or atomic bound states, provides the mechanism 
by which internal EM energy contributes to the overall mass of a system. The magnetic 
component of the EM back-reaction should already be familiar to most students as the 
phenomenon of self-induction. 

Different charge configurations lead to different back-reaction forces, which, along with 
internal kinetic energy such as electron orbital motion, accounts for most of the mass dif- 
ferential between different states of ordinary matter. Moreover, as has been emphasized by 
Wilczek,- back-reaction of the strong nuclear force accounts for most of the mass of protons 
and neutrons, only a small fraction of which comes from the intrinsic (or Higgs-derived) 
quark masses. Students who have studied self-induction therefore understand, in essence, 
one of the primary sources of mass in the universe, as well as one of the principal mechanisms 
of mass/energy equivalence, but they may not be aware of these connections. 

Given the centrality of this mechanism it would be beneficial to expose students to it, for 
example by computing back-reaction in simple classical configurations and showing its effect 
on mass. However, a vexing problem arises: computations of this sort do not give correct 
answers. One expects to find that the mass contributed by back-reaction equals the stored 
EM energy divided by c 2 , as predicted by Einstein's formula, but model computations give 
answers differing from this by geometry-dependent factors, most notoriously the factor 4/3 
for spherically symmetric cases; indeed, the problem has come to be known as the "4/3 
problem" .- 

An explanation for such discrepancies was proposed long ago by Poincare, who noted that 
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the charge distributions appearing in classical models are unstable and require additional 
forces to stabilize them.- Adding in the energy and momentum contributions from these 
"Poincare stresses" compensates the discrepancy and allows the overall momentum and 
energy to transform properly 4- vector. b 

Poincare stresses do resolve the problem, but when treated generically they leave the 
mechanism rather obscure, and the focus on additional forces is confusing since everyday 
matter is, in fact, constructed from EM forces and charged particles. Here we clarify how 
the problem is resolved within realistic atomic matter, with the aim of encouraging class- 
room discussion of back-reaction examples. The 4/3 problem has, of course, been discussed 
many times previously, but most often in the context of constructing classical models of the 
electron, which is not our present interest 

We view this as part of a broader effort to introduce more qualitative or constructive 
explanations into the teaching of special relativity, with the goal of improving students' 
intuition for relativistic effects and showing how those effects connect to prior elements 
of the physics curriculum.- - — No originality is claimed for the calculations to follow, and 
references have been cited when known. Natural units c = h = 1 will be used throughout, 
and "mostly-plus" metric signature (—,+,+, +).— Greek indices /x, u, . . . refer to 4-space 
coordinates, while . . . refer to 3-space, with repeated indices summed in both cases. 

II. MODELS OF BACK-REACTION AND RESULTING DISCREPANCIES 

Perhaps the simplest configuration in which to study EM back-reaction is the parallel- 
plate capacitor, FigJT] This also provides a model for bound states in general, since varying 
the separation of the plates changes the internal EM energy just as, e.g., for different atomic 
levels. 

We consider a capacitor having square plates with sides of length L parallel to the x, y 
axes, separated by distance h in the z direction. The top plate holds uniform surface charge 
density a > and the bottom plate —a, leading to the electric field E = —oz. 
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FIG. 1. Parallel-plate capacitor 



The question to be answered is how much of the mass of the capacitor is due to its EM 
field. According to Einstein's formula the answer should be 

AM = Hem (1) 

where Hem is the EM field energy, given by 

Hem = \J 'd 3 xE 2 . (2) 

One measures mass by trying to accelerate the system, i.e. by applying a force and 
measuring the total impulse required for the system to attain a given speed. Assuming that 
the external force doesn't interact with the internal EM field directly, the only way in which 
the internal EM field affects this measurement is through back-reaction onto the charges. 

We consider first accelerating the capacitor in the x direction, in which case the back- 
reaction takes the form of self-induction. As the charges accelerate, a magnetic field B 
develops between the plates, parallel to y. The growing B-field in turn creates an E-field 
which acts back on the charges, opposing the acceleration. Instead of following this process 
explicitly, we will calculate the total back-reaction impulse in a more general way which 
applies more easily to other configurations. 

The back-reaction is just force applied by the EM field, i.e., momentum exchanged be- 
tween the field and the matter. The total back-reaction impulse generated by an EM field 
between two time points is therefore the negative of the change in the EM field momentum. 
The EM field momentum is given by the Poynting vector 13 

Vem = J d 3 x (E x B) (3) 

and we note that the capacitor has Vem — when at rest. The cumulative back- reaction 
impulse from accelerating the capacitor to velocity /3 is then found by calculating Vem for 
the moving system, a calculation which is often quite easy because one can use a Lorentz 
transformation to find the EM field of the moving system. 
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Here, and in the remainder of the paper, we consider only small velocities, and work to 
first order in f3. First-order calculations suffice to determine the effective mass and exhibit 
the primary back-reaction mechanisms, while simplifying the calculations greatly. 

The expected EM momentum in light of Einstein's formula (EqJTJ) would be 

"Pem = PHem, (4) 

but the actual situation is not so simple. Applying the standard Lorentz transformation of 
the EM field, the boosted state acquires a magnetic field B = (3 x E, leading to momentum- 

Vem = J d 3 x(E x x E)) 

= J d 3 x 0E 2 — E0 ■ E)), (5) 

a formula which applies to any configuration having negligible initial B-field. 

For the capacitor accelerated along x, the second term vanishes, and by comparing with 
Eqs. (l2HI) one sees immediately that the result is too large by a factor of two! Hence, 
although the example clearly shows a qualitative effect of back-reaction on mass, it does not 
successfully reproduce the quantitative mass/energy relation. 

For acceleration in the z direction the situation is equally perplexing, for no B-field is 
generated and the final EM momentum is also zero.— It would appear that there is no back- 
reaction at all, hence no contribution from the EM energy to the mass of the system. One 
might say that the x case has a "factor of two" discrepancy, while the z case has a "factor of 
zero" discrepancy. In fact the discrepancy was discovered in studying spherically-symmetric 
geometries (models of the electron) and has long been referred to as the "4/3 problem", 
after the factor which is found for these cases. We note that, in addition to self-induction, 
one must usually also include electrostatic self-forces; however, Eq.Q remains valid. 

If the momentum stored in the EM field does not account for the relativistically-required 
contribution Eq.flU), the difference must be stored somewhere else, since the underlying the- 
ory of quantum electrodynamics does respect both momentum conservation and Lorentz 
invariance. The only place additional momentum could be stored is in the surrounding 
atomic material of the capacitor, hence there must be a compensating momentum discrep- 
ancy in the atomic matter; in other words, the total momentum of the electrons and nuclei in 
the moving atomic system must also differ from that expected by simply applying Einstein's 
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formula (the analog of EqJH for the matter subsystem) . In the following sections we show 
how this "matter momentum discrepancy" arises. 

For future reference we show the general form of the EM momentum discrepancy, obtained 
by subtracting Eq.fllJ) from Eq.(J3]): 



III. MOMENTUM DISCREPANCY IN THE MATTER SYSTEM 

If matter consisted of a static assembly of components then it would be hard to see how an 
object could harbor any momentum beyond that of its bulk motion; however, the modern 
view of matter involves a great deal of internal motion which can, in fact, develop a net 
momentum in relativistic theories. We consider first a classical atomic model consisting of 
a uniformly distributed ring of identical charged particles in circular orbits at fixed radius 
around a central charged nucleus. We neglect radiation, and assume that the total orbiting 
charge density is small enough that the charges have negligible effect on each other's motion. 
The force on a charge q located at position r relative to the nucleus is F = —q\E\f, where 
\E\ is constant. 

When the system (i.e., the nucleus) is at rest, the orbital momenta of the charges cancel 
and add nothing to the total momentum. In Newtonian physics this would also be true of 
the moving system, but in a relativistic theory it need not be. We consider boosting the 
system to a small velocity /3, where /3 lies in the plane of the orbits (Figj2]). The orbits 
remain circular to first order, since length contraction starts at 0((3 2 ), but they receive two 
types of first-order correction. 

First, the orbital velocities are altered due to the relativistic addition-of- velocity formula. 
This change does not, however, lead to any momentum discrepancy, since it corresponds to 
taking a snapshot of the particle momenta at a fixed time and then Lorentz-transforming 
them all. The resulting total 4- momentum is just the Lorentz transform of the original total 
4-momentum, with no extra discrepancy; in other words, this part gives rise to the analog 
of Eq. (jlj) for the orbiting particles. 

Transforming all the momenta in a fixed-time snapshot does not, however, give an accu- 
rate picture of the moving system, since the definition of fixed time differs between frames. 
Because the orbiting particles are continually accelerating, this difference can translate into 




(6) 
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an extra contribution to the total momentum. More concretely, the varying orbital velocities 
of the particles also imply a varying density, which creates net momentum. 

It is easiest to calculate the momentum discrepancy directly in terms of the internal force 
F(r) and the relativistic simultaneity change. The boosted trajectories are obtained, to first 
order, by substituting the coordinate change 

x — > a? — x — f3t 

t^t' = t- $ -x (7) 

into the rest-frame trajectories. The momentum discrepancy arises from the simultaneity 
shift, 

5t = -(3- x, (8) 

which effectively shifts the particles forwards or backwards in their trajectories by St, giving 
rise to the density shifts shown schematically in Figj2j During time 6t, a particle at location 
r changes its momentum by F(r)5t, hence the overall momentum discrepancy is 

5P = - 

charges 

In the last line, the equation has been written in more general form as an integral against 
the orbital charge distribution p, anticipating calculations to follow. 




FIG. 2. Orbiting particle system at rest (left), showing uniform density around the orbit, and 
boosted (right), showing varying density. Arrows at left indicate the orbital velocity. Symbols 
+, — at right indicate the sign of 5t, Eq.8. 
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For the circular orbits of the model, 5P is nonzero since both /3 ■ x and E are odd under 
reflection through the origin; hence, the orbital momentum develops a discrepancy, and the 
next step is to compare this to the EM momentum discrepancy, Eq.([6]). For this we first 
need to rewrite Eq.flH]) to include the full charge distribution, i.e. the nuclear charge as 
well, since Eq.([6]) includes the E- fields arising from all charges. We define p\,p 2 to be the 
orbital and nuclear charge densities, respectively, and E\,E 2 to be the respective electric 
fields resulting from these charges. We can then write 

5P = - J d 3 x ■ x){p x E 2 + p 2 E 1 ) (10) 

where the first term is just Eq.flHJ), while the second term vanishes since E\ is zero at the 
center of the orbit, where p 2 is concentrated. 

— * 

Since the B-field is negligible in the system at rest, the fields E^ are derived from scalar 
potentials, E t = — V0i, and one has also V 2 0« = — Inserting the scalar potentials and 
integrating by parts several times, one arrives at 

5P = J d 3 x (-${E X ■ E 2 ) + E x ■ E 2 ) + E 2 ■ E^ . (11) 

(This result is also shown through a slightly more general method in SecJVl) 

This may now be compared to the EM momentum discrepancy, Eq.flEJ), with E = E\ + 
E 2 , and one sees that the matter discrepancy cancels the "interaction" portion of the EM 
momentum discrepancy, i.e., the terms containing both Ei and E 2 ? It does not cancel the 
"self-energy" terms which are quadratic in E\ or E 2 alone; this makes sense because the self- 
energies are already implicitly included in the particle masses of the classical model. We note 
also that it is the interaction energy which is relevant for understanding mass differentials 
between different configurations of the same charges, e.g. a capacitor with varying plate 
separation. 

Eq. lfTTj) and its comparison to the EM momentum discrepancy are, intuitively, the main 
results, but the classical model is too far removed from the true atomic theory to be fully 
convincing. In addition to many other well-known deficiencies, one may doubt that the 
orbiting system would, in fact, attain the same state through physical acceleration which 
is found by Lorentz transformation, because the orbits are largely unconstrained and could 
change in many ways. This doubt is considerably reduced in the wave theory of atomic 
orbitals due to the discreteness of the solutions, since a slow acceleration will not typically 
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cause a jump between different solutions.— For these reasons, we recalculate the momentum 
discrepancy in the following section using the Dirac formalism. 

IV. MOMENTUM DISCREPANCY IN THE DIRAC THEORY 

For a Dirac particle of charge q coupled to an EM field ((f), A), the Hamiltonian and 
momentum operators are (see Appendix and ref. 16) 

H = a l pi + /3m + q<p 
f = -i(d { -iqA% (12) 

where a\ /3 are the standard matrices of Dirac is unrelated to the boost velocity /3 l ). 
We will consider just one Dirac field, and work in the single-particle formalism, ignoring the 
complications of multi-electron states and the additional field for the nucleus; these do not 
fundamentally change the calculation. 

Given a wavefunction ip which satisfies the Dirac equation, boosting the wavefunction 
means substituting the coordinate changes, Eq.(J7J), and applying a spinor rotation given by 
|(/3 • &).— Working for convenience at time t = 0, the only coordinate change is 5t = —f3 ■ x, 
the effect of which can be computed using the Hamiltonian: 

8^ = -0-x)d^ + ^0-6c)^ 

= i0-x)Hij) + -0-a)il). (13) 

The change in expected momentum is then given by 

5 (P*) = 5 (^J (fx^fipj = J d 3 x (5^p l i) + ^fdip + ^5p^j) . (14) 

We choose a gauge where A = in the rest frame (again assuming negligible B field), so the 
boosted state has A = f3(p (an additional gauge transformation could accompany the boost, 
but there is no reason to consider this). The change in A implies 5p l = —qcf)^ 1 , and we will 
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— * — * — * 

also need to use [H, (f3 ■ x)] = —i{3 ■ a, which cancels the spinor rotation terms. One finds 



d 3 x ^ 



-%H(f3-x) + -((5-a) 



p +p 



i(f3 ■ x)H + -((5 ■ a) 



- q<t>? V> 



d 3 x (-z(/3 ■ x)i/)i[H,&]i/> + i^[p\ (3 ■ x}Hif; - 

^^{H- qcf^J + / d 3 x q0 ■ x)&4nfiil) 

= ft (H - qcj)^ - j ' d 3 x(f3-x)(p)E\ (15) 

The first term is the expected (non-discrepancy) term, and contains the Dirac single-particle 
energy minus the EM interaction energy; this is the correct energy to use in conjunction 
with the EM expression Eq.(j2]), because the latter already includes all matter interaction 
energies (see Appendix). The second term is the momentum discrepancy, confirming the 
result found from the classical model, Eq.(J5]). 

This calculation, based as it is on a fixed background EM field ((f), A), still does not 
account for self-energies, which are again implicitly included in the particle masses. A full 
calculation would require quantum field theory and renormalization, but otherwise would 
follow the same manipulations leading to Eq. (fl5l) . and indeed would lead to the same result, 
at lowest order. 



V. THE EM MOMENTUM DISCREPANCY MORE GENERALLY 

The formula (Eqj6]) for the EM momentum discrepancy involved only the EM field, while 
the matter momentum discrepancy (e.g., Eqj9]) involved both the EM field and the charge 
density. The computation outlined above Eq. lflT!) shows that the two forms are, in fact, 
opposite to each other, but it is worthwhile to show this using a slightly more general 
method. 

The EM energy and momentum derive from the stress-energy tensor (see Appendix) 

T/ = —F lMX F va + - A 5^F^F a p (16) 

where F^ = d^A u — dyA^ is the EM field strength. In the presence of charges, is not 
conserved but rather satisfies 
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where J M is the total charge current, with p = J°. The EM momentum and energy Eqs. (l2ll3]) 
are given by 

n M = j d 3 xT \ 

Hem = J d 3 xT ° (18) 

and we wish to calculate the variation of V EM under the small boost Eq.([7j). This is given 
by the tensor index transformations plus the time change: 

&em = j d 3 x {/3 l T ° - j T/ - ■ x)d T 1 }. (19) 

In the last term we can substitute Eq. ffT7|) and integrate the resulting spatial derivatives by 
parts, cancelling the second term and arriving at 

5V EM = /3 1 Hem + j d 3 x ■ x)F ia J a (20) 

which, for negligible B field, and noting E % = F 01 , reduces to 

5V EM = P l U EM + J d 3 x ■ x) P E i (21) 

showing a discrepancy exactly opposite to the matter momentum discrepancy of Eqs. ()9|T5l) . 

VI. CONCLUSION 

The apparent discrepancies in back-reaction calculations involving classical charge distri- 
butions are thus accounted for by compensating discrepancies within the materials holding 
the charges. This is precisely the mechanism proposed by Poincare many years ago; our 
purpose has been to describe it more concretely in terms of the underlying matter theories 
which are now known to hold. 

More specifically, the external electric field which is sourced by the classical charges 
(e.g., charges on the capacitor plates) develops a momentum discrepancy Eq.(jHJ), which 
seems puzzling because the classical charges are fixed in place and cannot store additional 
momentum. However, Eq.f TTTj) shows that the full EM momentum discrepancy, both external 
and internal, is cancelled by the corresponding matter discrepancies, when all charges are 
included. The classical charges do not contribute to this because (noting EqEJ) the total 
E-field at their locations must vanish in order for the charges to be static. 
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Discrepancies of the "4/3" variety are seen to be inevitable when boosting a composite 
relativistic system, because the boost alters the simultaneity surface differently in different 
directions, changing the way momentum is divided between the subsystems. Another way 
to understand the discrepancies is to note that the back-reaction from one subsystem onto 
another arises from a local interaction, which has no way of accounting for the total energy 
contained in the extended region occupied by the subsystem. This was not an issue in 
Newtonian physics because interactions in that paradigm are not mediated by local fields, 
hence do not cause energy and momentum to be dispersed over an extended spatial region 
to begin with. 

A more formal way to see the orgin of discrepancies is to observe that the interaction La- 
grangian Cj typically contains no time derivatives (cf. Appendix), hence does not contribute 
to the momentum density Tq\ but the energy To° does contain — £/. The boosted momen- 
tum must then contain a term —f3Cj, and this can only be produced through discrepancies 
in the subsystem momenta. 

APPENDIX: DERIVATION OF ENERGY-MOMENTUM EXPRESSIONS 

For completeness in understanding Sections [IV] and |V] we show the derivation of funda- 
mental energy- and momentum-related quantities for the EM and Dirac fields. The action 
for an EM field coupled to fermions of charge q is given by C = Cem + where^ 

= -^Y{d^ - iqA^ - mW> (22) 
where F^ = dfj,A u — d u A^. The EM field satisfies the equation 

Q^U = _JU ( 23 ) 

where J v is the charge current iqip^ip. We note that in the mostly-plus metric convention 
one has ip = iip^ , giving J° = — qip^ ('y°) 2 ip = qip^ip, which is the expected form of the 
charge density. 

The matter equation of motion is 

7^ - iqA^ip + m^j = (24) 
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from which one derives 



d ip = 7V - iqAi)ip + m^ip + iqA°ip 

= -iHtfj (25) 

where the single-particle Dirac Hamiltonian H is as in Eq.( ll2p . and we note the mostly-plus 
definitions (3 = ry° and a 1 = — 7°7\ as well as <fr = A . 
The canonical stress-energy (SE) tensors are given by^ 

{TemT v = ~ 5 ^m 

= -F^d v A a - b\L EM (26) 

and 

= -frW - S\C,„. (27) 
The total energy and momentum of the system are then defined by 

n = j (fx {(T EM ) ° + (T^q } , 

T* = J d 3 x {(T^V + ^V}. (28) 

These overall quantities are gauge invariant since they are derived from a gauge-invariant 
Lagrangian, but the separate EM and matter tensors are not gauge invariant, and using 
{Tem)h" one does not obtain the familiar expressions Eqs.( |2ll3l) . To avoid this inconvenience 
one usually redefines the EM tensor to the gauge-invariant form 

(Tem)^ = —F^ a F ua — 8^ 'Cemi (29) 

which gives the normal expressions Eqs. (l2~f3"|) . This change corresponds to adding an extra 
term 

AT/ = F«d a A v (30) 

to the canonical EM tensor, and the same must then be subtracted from the matter tensor. 
Both SE tensors are then separately gauge invariant. We note that the overall SE tensor 

13 



is not symmetric, although the EM side becomes symmetric after the correction. The fully 
symmetrized SE tensor is obtained by adding a further total divergence to the matter SE 
tensor, which does not change the integrated energy and momentum. 1 - 

Hence, to verify that matter momentum discrepancies cancel those generated by the 
standard EM expressions Eqs. (T2"f3"j) . one must use energy and momentum derived from the 
matter SE tensor 

= (T^)/ - Ffd a A" (31) 

which looks strange, but gives a reasonable result because the added term can be integrated 
by parts and then converted to a ip term using Eq. (1231) . In this way one finds energy 

= J d 3 x {(7» ° - F a d a A } 
= J d 3 x {{T^ + J A } 

= J d 3 x i) ] (H-q<p)i) (32) 

showing the correctness of the energy used in Eq. (1151) . For the Dirac field momentum one 
finds similarly that the added SE term converts derivatives <9j to the gauge-invariant form 
di — iqAi, justifying the gauged momentum operator given in Eq. (fi"2"j) . We note lastly that 
the SE tensor alteration Eq. (1301) does not affect the momentum discrepancies in the two 
subsystems, as this term by itself has no discrepancy. 
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